It can happen that some categorical features have lot of different values with small frequencies. This can often be a problem during modeling phase.
Typical solution is to merge all categories with small (e.g. 5% , 1% etc… ) frequency of appearance in data set into one category e.g. Other
Below is the pandas example on the sweet data set 🙂
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.